Fundamental Frequency (F0) Fusion Transformation-Based on BLSTM for Voice Conversion
نویسندگان
چکیده
منابع مشابه
F0 transformation within the voice conversion framework
In this paper, several experiments on F0 transformation within the voice conversion framework are presented. The conversion system is based on a probabilistic transformation of line spectral frequencies and residual prediction. Three probabilistic methods of instantaneous F0 transformation are described and compared. Moreover, a new modification of inter-speaker residual prediction is proposed ...
متن کاملSpeaker-independent HMM-based voice conversion using quantized fundamental frequency
This paper proposes a segment-based voice conversion technique between arbitrary speakers with a small amount of training data. In the proposed technique, an input speech utterance of source speaker is decoded into phonetic and prosodic symbol sequences, and then the converted speech is generated from the pre-trained target speaker’s HMM using the decoded information. To reduce the required amo...
متن کاملVoice conversion based on parameter transformation
This paper describes a voice conversion system based on parameter transformation [1]. Voice conversion is the process of making one person’s voice “source” sound like another person’s voice “target”[2]. We will present a voice conversion scheme consisting of three stages. First an analysis is performed on the natural speech to obtain the acoustical parameters. These parameters will be voiced an...
متن کاملHMM-based robust voice conversion using adaptive F0 quantization
This paper proposes an HMM-based voice conversion (VC) technique with quantized F0 symbol context using adaptive F0 quantization. In the HMM-based VC, an input utterance of a source speaker is decoded into phonetic and prosodic symbol sequences, and the converted speech is generated using the decoded information from the pre-trained target speaker’s phonetically and prosodically context-depende...
متن کاملText-independent F0 transformation with non-parallel data for voice conversion
In voice conversion, a simple frame-level mean and variance normalization is typically used for fundamental frequency (F0) transformation, which is text-independent and requires no parallel training data. Some advanced methods transform pitch contours instead, but require either parallel training data or syllabic annotations. We propose a method which retains the simplicity and text-independenc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Science Discovery
سال: 2018
ISSN: 2331-0642
DOI: 10.11648/j.sd.20180604.21